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The identification of the solar-like oscillation modes, as measured by asteroseismology, is a necessary requirement in 
order to infer the physical properties of the interior of the stars. Difficulties occur when a large number of modes of 
oscillations with a low signal-to-noise ratio are observed. In those cases, it is of common use to apply a likelihood- 
ratio test to discriminate between the possible scenarios. We present here a statistical analysis of the likelihood-ratio test 
and discuss its accuracy to identify the correct modes. We use the AsteroFLAG artificial stars, simulated over a range 
of magnitude, inclination angle, and rotation rate. We show that the likelihood-ratio test is appropriate up to a certain 
magnitude (signal-to-noise ratio). 
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1 Introduction 

The recent development of space-based instruments such 
as the Convection, Rotation, and planetary Transits mis- 
sion (CoRoT ; [Michel et al. ||2008l l and the NASA's Ke- 
pler mission ( Borucki et al. |20d9|) as well as th e organized 
ground-based campaigns ( Arentoft et al. |20(j8] l are produc- 
ing a huge volume of asteroseismic observations of unprece- 
dented quality. Thus, it is now possible to correctly detect 
individual pressure (p) driven modes and to describe the os- 
cillation modes with Lorentzian profiles, as it is commonly 
done for the Sun (e.g., 'Chapli n et al. |[2006) l. However, in 
the stellar case, there is a substantial difference: while in the 
Sun, the inclination of the rotation axis and the surface ro- 
tation rate are known, in most of the stars analyzed up to 
now, these two parameters are unknown. Moreover, as these 



two parameters are highly correlated ( [Ballot et al. 2006 



Gizon & Solanki [2003 1, in order to improve the stability of 



the fits, the traditional pair-by-pair fitting methodology fol 
lowed in the Sun is changed to a global strategy in which all 
the modes are fitted at the same time along with the inclina- 
tion angle and one value for the rotational splitting. The first 
application of this strategy was introduced in asteroseismol- 
ogy by Appourchaux et al7] ( [200 8 ) and it has been since suc- 
cesfully apphed to the CoRoT solar-like targets in which the 
signal-to noise ratio was high enough (Barba n et al. |20(j9 



[Deheuvels et al. |[2010| [Garcia efal. ||2009| ). Indeed, some 
of the measured CoRoT stars were too faint to perform such 
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kind of analyses ( [Mathur et al. |2010a[|Mosser et al. [2009 1. 
Moreover, the global fitting has also been applied to the first 
Kepler solar-like targets (I Chaplin et al.[[20T0l l, as well as 
the latest ground-based observational campaign of Procyon 
( [Bedding et al. |2010b| l. 



Everything would be great if we were able to identify 
the individual modes (or mode-tagging) before performing 
the peak-bagging. Indeed, for most of the observations, a 
clear identification of the modes (or of the ridges) appears 
to be a difficult task and it is then necessary to perform 
the peak- fitting using the two possible identifications. The 
possible scenarios (or taggings) are discriminated by com- 
paring a posteriori the likelihoods of the minimization, the 
highest likelihood being chosen as the correct mode identi- 



fication. Benomar et al. (2009 1 using a longer time series - 
which means a better overall signal-to-noise ratio - demon- 
strated by comparing the likehhoods that the first p-mode 



identification done by Appourchaux et al. ( 2008 1 of the 



CoRoT target HD49933 was wrong. However, we present in 
this work that the so-called likelihood-ratio test has certain 
limits depending on both the magnitude of the star and the 
length of the observations (i.e., the overall signal-to-noise 
ratio of the modes in the power spectrum). It is important to 
remember that an incorrect mode tagging would have very 
bad consequences on the inferences of the stellar properties 
(e.g., [Creevey et al.|2007|[Stello et al. 12009] ) . 
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2 Data and analysis 

We used simulated data generated within the AsteroFLAG 
team for hare-and-hounds exercises ( Chaplin et al. ||2008 



Table 1 Fundamental stellar parameters of Pancho 



Class 


Iog(3)(cm/s^) 


log(Z/X) 


Teff(K) 


Dwarf(V) 


4.3 ±0.1 


-1.4 ±0.1 


6383 ± 40 



In this work, we concentrate on one of these stars: Pancho, 
whose fundamental parameters \og{g), \og{Z/X), and Toff 
are given in Table [T] The following analysis is performed 
over a range of: 

- apparent magnitude M„: 9, 10, 11, and 12; 

- incUnation angle: 30° and 60°; 

- rotation rate: 1.5 (slow), 3.0 (medium), and 5.0 (high) 

The original simulated ^ 3-year time series (with a 60- 
second cadence) were divided into 3, 12, and 36 sub-series 
of 365, 93, and 31 days respectively. The global param- 
eters Av, Freq(min), Freq(max), i^max. ™d A^ax derived 
from 3 1 -day time series are given in Table |2] ( Mathur et al. 
|2010b[ ). A Maximum-Likelihood Estimator (MLE) global 
fitting of the / = 0, 1,2 modes using the strategy defined 
in I Appourchaux et al. |P008] | is performed on each power 
spectrum. The number of fitted modes is fixed as a function 
of the length T of the sub-series as follows: 

- 31 days: 6 and 8 large separations; 

- 93 days: 10 and 14 large separations; 

- 365 days: 14 large separations. 

We define two different mode taggings: A for the correct 
one, and B for the incorrect one. Two different peak-fitting 
approaches were used as well: one with no Bayesian con- 
straints, and one with three Bayesian constraints, defined 
as: 

- inclination angle ±15° of the simulated value; 

- rotational splitting ±1.5/iHz of the simulated value; 

- Full-Width-at-Half-Maximum (fwhm) of the high or- 
der modes > 2/iHz (to avoid fitting spikes instead of 
wider modes). 

We have run our fitting code on the GODUNOV cluster 
at CEA/SAp. It has 15 bi-processors nodes and 15 dual core 
bi -processor nodes, for a total of 90 cores connected through 
a gigabit network and running IDL. We have used typically 
15 to 30 cores for 2 to 5 days to run all the fittings. Indeed, 
we run in parallel several fits of the same time series using 
different strategies. 

3 Comparison with the input frequencies 

We compared the fitted frequencies with the input frequen- 
cies of the simulated data. For both the correct (A) and in- 
correct (b) taggings. Fig. [T] shows the resulting histograms 
of the frequency differences between FITS and INPUTS for 
the I = modes in the case of the 93-day time series (slow 
rotation and inclination angle of 60°). The percentages of 
good fits within 3(7 are indicated. 

While most of the fits fall within the 3(j limit, clear 
offsets between the fitted and input frequencies are present 



Table 2 Global parameters derived from 31 -day time se- 
ries (M„= 9 and 10) of Pancho ( |Mathuret ari2010b) 





Freq(min) 


Freq(max) 


^max 






(AtHz) 


(fiHz) 


(mHz) 


(rms ppm) 


69±1.5 


1000 


2300 


1700±50 


2.7±0.3 



when the incorrect tagging (b) is chosen (Fig.|2]i. These off- 
sets of a few /iHz are larger than 3(jf (a-f = (formal errors)) 
and tend to increase with stellar magnitude. When the cor- 
rect tagging (a) is chosen, these offsets are minimal and are 
within the 3<t / limit. 

4 Likelihood-ratio test 

We present here the reliability of the likelihood-ratio test as 
a function of the star magnitude M„ and of the length of 
the observations T. The following discussion concerns the 
AsteroFLAG star Pancho with fast rotation (5/iHz) and an 
inclination angle of 60°. For the 3 lengths of observations 
(T = 31, 93, and 365 days). Fig. [3] shows the mean and the 
median values of the likelihood ratios between the correct 
(a) and incorrect (b) mode identifications over the entire 
set of analyzed power spectra as a function of M„. The per- 
centages that the likelihood-ratio test returns the incorrect 
tagging (b) as being more likely are also represented (right 
y-axis). A clear dependence with and T is identifiable. 
For the same star and as the length of the time series in- 
creases, the likelihood-ratio test is more likely to return the 
correct answer. However, as the magnitude of the star in- 
creases, the likelihood-ratio test is not as reliable. 

The percentages that the likelihood-ratio test returns the 
incorrect tagging (green curves on Fig[3]l are: 

- 31-day time series: ^ 25% up to My = 11; 

- 93-day time series: < 25% up to M„ = 12; 

- 365-day time series: 0% up to My = 11. 

In the case of stars with slow rotation, these percentages 
get larger, with for instance, 25% chance that the likelihood- 
ratio test returns the incorrect mode identification as being 
more likely for M„ = 9 and T = 93 days. The introduction 
of Bayesian conditions can help mainly when the character- 
istics of the star make the mode identification difficult, for 
example, a star with slow rotation. 

5 Conclusions 

The identification of the oscillations modes (or mode tag- 
ging) is a necessary step before we can infer the physi- 
cal properties of the star interiors. Even when using global 
peak-fitting techniques, the mode tagging remains a difficult 
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Fig. 1 Histograms of the frequency differences between FITS and INPUTS for the I — modes in the case of the 93-day 
Pancho time series, with a slow rotation and an incUnation angle of 60°. The red solid lines correspond to the fitted Gaussian 
function to the distribution, and the dashed red lines to the mean of the distribution. The black dotted lines correspond to 
a difference (FITS - INPUTS) equal to 0, while the green dashed lines represent the 3(7 limit (cr = standard deviation of the 
gaussian distribution). The percentages of good fits within the 3a limit are also given. 



task to achieve, when for instance, a large number of oscil- 
lation modes are observed with a low signal-to-noise ratio. 
The likelihood-ratio test can help to discriminate between 
the possible scenarios (i.e. taggings) and it has been already 
successfully applied in asteroseismology with the CoRoT, 
Kepler, and Procyon observations. Nevertheless, by analyz- 
ing the AsteroFLAG artificial star Pancho, we showed in 
this work that the likelihood-ratio test has certain limits de- 
pending on both the star magnitude and the length of the 
observations. For example, for a star with a rotational split- 
ting of 5/iHz and an inclination angle of 60°, observed dur- 
ing 31 days, the likelihood-ratio test will statistically return 
the incorrect mode identification 25% of the time for a star 
magnitude up to M„ = 11. This percentage decreases as the 
length of observation increases and for time series of 365 
days, the likelihood-ratio test will return 100% of the time 
the correct identification up to M„=ll. However, for stars 
with slow rotation, the likelihood-ratio test is more likely 
to return the incorrect mode tagging. The use of Bayesian 
conditions help mostly when the star characteristics (for ex- 
ample, slow rotation) make the mode identification difficult. 

With the long-term observations that will be collected 
by the Kepler mission, we hope to have enough signal-to- 
noise ratio to unambiguously determine the correct identifi- 
cation of the modes for both solar-like stars ( jChaplin et al.| 
2010[ ) and red giants ( Bedding et al.|2010a i, as well as the 



stars in open clusters ( |Stello et al. |20ld) l. However the rel- 
ative faintness of these later stars will probably require the 
use of the likelihood-ratio test to disentangle between the 
tagging of the modes. 
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Fig. 3 Likelihood ratios between the correct (A) and in- 
correct (b) mode identifications over the entire set of ana- 
lyzed power spectra as a function of the star magnitude M„ 
and the length of observations T (from top to bottom, T = 
31, 93, and 365 days respectively) for an inclination angle 
of 60°. Both mean (red squares) and median (black dots) 
values of the likelihood ratios are shown. The percentages 
that the likelihood-ratio test returns the incorrect tagging (B) 
as being more likely are given using the right y-axis (green 
stars). 
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